Learning to Search on Manifolds for 3D Pose Estimation of Articulated Objects
نویسندگان
چکیده
This paper focuses on the challenging problem of 3D pose estimation of a diverse spectrum of articulated objects from single depth images. A novel structured prediction approach is considered, where 3D poses are represented as skeletal models that naturally operate on manifolds. Given an input depth image, the problem of predicting the most proper articulation of underlying skeletal model is thus formulated as sequentially searching for the optimal skeletal configuration. This is subsequently addressed by convolutional neural nets trained end-to-end to render sequential prediction of the joint locations as regressing a set of tangent vectors of the underlying manifolds. Our approach is examined on various articulated objects including human hand, mouse, and fish benchmark datasets. Empirically it is shown to deliver highly competitive performance with respect to the state-of-the-arts, while operating in real-time (over 30 FPS).
منابع مشابه
3D Pose Estimation, Tracking and Model Learning of Articulated Objects from Dense Depth Video using Projected Texture Stereo
Service robots deployed in domestic environments generally need the capability to deal with articulated objects such as doors and drawers in order to fulfill certain mobile manipulation tasks. This however, requires, that the robots are able to perceive articulated furniture objects such as cupboards, dishwashers and cabinets. In this paper, we present an approach for detecting, tracking, and l...
متن کاملParametrized SOMs for Object Recognition and Pose Estimation
We present the “Parametrized Self-Organizing Map” (PSOM) as a method for 3D object recognition and pose estimation. The PSOM can be seen as a continuous extension of the standard SelfOrganizing Map which generalizes the discrete set of reference vectors to a continuous manifold. In the context of visual learning, manifolds based on PSOMs can be used to represent the appearance of various object...
متن کاملContinuous - state Graphical Models for Object Localization , Pose Estimation and Tracking
of “Continuous-state Graphical Models for Object Localization, Pose Estimation and Tracking” by Leonid Sigal, Ph.D., Brown University, May 2008. Reasoning about pose and motion of objects, based on images or video, is an important task for many machine vision applications. Estimating the pose of articulated objects such as people and animals is particularly challenging due to the complexity of ...
متن کاملA generalized parametric 3D shape representation for articulated pose estimation
We present a novel parametric 3D shape representation, Generalized sum of Gaussians (G-SoG), which is particularly suitable for pose estimation of articulated objects. Compared with the original sum-of-Gaussians (SoG), G-SoG can handle both isotropic and anisotropic Gaussians, leading to a more flexible and adaptable shape representation yet with much fewer anisotropic Gaussians involved. An ar...
متن کاملShape Models of the Human Body for Distributed Inference
of “Shape Models of the Human Body for Distributed Inference” by Silvia Zuffi, Ph.D., Brown University, May 2015 In this thesis we address the problem of building shape models of the human body, in 2D and 3D, which are realistic and efficient to use. We focus our efforts on the human body, which is highly articulated and has interesting shape variations, but the approaches we present here can b...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1612.00596 شماره
صفحات -
تاریخ انتشار 2016